Using citations to facilitate precise indexing and automatic index creation in collections of research papers
نویسندگان
چکیده
We describe Rosetta, a digital library system for scientific literature. Rosetta makes it easy for people to find the information for which they are looking even when using short, imprecise queries. Rosetta indexes research articles based on the way they have been described when cited in other documents. The concise descriptions that occur in citations are similar to the short queries people typically form when searching; therefore, citations may make a better basis for indexing than do the words used within a research article itself. Using this indexing technique we are able to provide a user interface that presents users with an automatically generated directory of the information space surrounding a query. Our objective with this interface is to present people with the information for which they have asked as well as the information for which they may have intended to ask.
منابع مشابه
Automatic keyword extraction using Latent Dirichlet Allocation topic modeling: Similarity with golden standard and users' evaluation
Purpose: This study investigates the automatic keyword extraction from the table of contents of Persian e-books in the field of science using LDA topic modeling, evaluating their similarity with golden standard, and users' viewpoints of the model keywords. Methodology: This is a mixed text-mining research in which LDA topic modeling is used to extract keywords from the table of contents of sci...
متن کاملمدل دو مرحله ای شکاف- گلچین برای نمایه سازی خودکار متون فارسی
Purpose: Each language has its own problems. This leads to consider appropriate models for automatic indexing of every language. These models should concern the exhaustificity and specificity of indexing. This paper aims at introduction and evaluation of a model which is suited for Persian automatic indexing. This model suggests to break the text into the particles of candidate terms and to c...
متن کاملRobust Audio Indexing for Dutch Spoken-word Collections
Whereas the growth of storage capacity is in accordance with widely acknowledged predictions, the possibilities to index and access the archives created is lagging behind. This is especially the case in the oral history domain and much of the rich content in these collections runs the risk to remain inaccessible for lack of robust search technologies. This paper addresses the history and develo...
متن کاملبررسی تأثیر استنادی مقالههای مجلات علمی- پژوهشی فارسی زبان ایران
Purpose: We have studied in the present research the citation impact of the papers published in Iranian Farsi scientific-research journals in the three fields of medical sciences, agricultural sciences, and human sciences, which have been indexed in SID citation database. Methodology: This study used citation analysis. The population under study included all papers with two or more citations i...
متن کاملManipulating Google Scholar Citations and Google Scholar Metrics: simple, easy and tempting
The launch of Google Scholar Citations and Google Scholar Metrics may cause a revolution in the research evaluation field as it places within every researcher’s reach tools that allow them to measure their output. However, the data and bibliometric indicators offered by Google’s products can be easily manipulated. In order to alert the research community, we present an experiment in which we ma...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
عنوان ژورنال:
- Knowl.-Based Syst.
دوره 14 شماره
صفحات -
تاریخ انتشار 2001